A biologically plausible model of human planning based on neural networks and Dyna-PI models
نویسنده
چکیده
Understanding the neural structures and physiological mechanisms underlying human planning is a difficult challenge. In fact it is the product of a sophisticated network of different brain components that interact in complex ways. However, some data produced by brain imaging, neuroanatomical and neurophysiological research, are now beginning to make it possible to draw a first approximate picture of this network. This paper proposes such a picture in the form of a neural-network computational model inspired by the Dyna-PI models (Sutton, 1990). The model is based on the actor-critic reinforcement learning model, that has been shown to be a good representation of the anatomy and functioning of the basal ganglia. It is also based on a “predictor”, a network capable of predicting the sensorial consequences of actions, that may correspond to the lateral cerebellum-prefrontal and rostral premotor cortex pathways. All these neural structures have been shown to be involved in human planning by functional brain-imaging research. The model has been tested with an animat engaged with a landmark navigation task. In accordance with the brain imaging data, the simulations show that with repeated practice performing the task, the complex planning processes, and the activity of the neural structures underlying them, fade away and leave the routine control of action to lower-level reactive components. The simulations also show the biological advantages offered by planning and some interesting properties of the processing of “mental images”, based on neural networks, during planning. On the machine learning side, the model presented extends the Dyna-PI models with two important novelties: a “matcher” for the self-generation of a reward signal in correspondence to any possible goal, and an algorithm that focuses the exploration of the model of the world around important states and allows the animat to decide when planning and when acting on the basis of a measure of its “confidence”. The paper also offers a wide collection of references on the addressed issues.
منابع مشابه
Planning with neural networks and reinforcement learning
planning with neural networks, time limits of discounted reinforcement learning Planning, taskability, Dyna-PI architectures Dyna-PI architectures: focussing, forward and backward planning, acting and (re)planning. Tested with... Ideas from problem solving and
متن کاملImproving the Izhikevich Model Based on Rat Basolateral Amygdala and Hippocampus Neurons, and Recognizing Their Possible Firing Patterns
Introduction: Identifying the potential firing patterns following different brain regions under normal and abnormal conditions increases our understanding of events at the level of neural interactions in the brain. Furthermore, it is important to be capable of modeling the potential neural activities to build precise artificial neural networks. The Izhikevich model is one of the simplest biolog...
متن کاملDaily Pan Evaporation Estimation Using Artificial Neural Network-based Models
Accurate estimation of evaporation is important for design, planning and operation of water systems. In arid zones where water resources are scarce, the estimation of this loss becomes more interesting in the planning and management of irrigation practices. This paper investigates the ability of artificial neural networks (ANNs) technique to improve the accuracy of daily evaporation estimation....
متن کاملEffects of Combination of Intravenous Anesthetic Agents on Human Glycine Receptor evaluated by Artificial Neural Networks
Introduction: Intravenous general anesthetic agents are among the most important and widely used anesthetic drugs in the clinical practice. Many pharmacological studies have shown that potentiation of GABA and glycine on their receptors is the most plausible mechanism. Nevertheless, there is limited information on the effects of co-administration of two or more of these agents. However, expe...
متن کاملارائه مدلی جهت پیش بینی بیماری دیابت با استفاده از شبکه عصبی
Introduction: Meta-heuristic and combined algorithms have a great capability in modelling medical decision making. This study used neural networks in order to predict Type 2 Diabetes (T2D) among high risk individuals. Methods: This study was an applied research. Data from 545 individuals (diabetic and non-diabetic), in Diabetes Clinic of Hamedan University of Medical Sciences, we...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002